Field Guide to Hadoop by Kevin Sitto & Marshall Presser

Field Guide to Hadoop by Kevin Sitto & Marshall Presser

Author:Kevin Sitto & Marshall Presser
Language: eng
Format: epub, pdf
Publisher: O'Reilly Media, Inc.
Published: 2015-03-18T04:00:00+00:00


Tutorial Links

The GitHub page for the Parquet format project is a great place to start if you’re interested in learning a bit more about how the technology works. If, on the other hand, you’d like to dive straight into examples, move over to the GitHub page for the parquet m/r project.

Example Code

The Parquet file format is supported by many of the standard Hadoop tools, including Hive (described here) and Pig (described here). Using the Parquet data format is typically as easy as adding a couple lines to your CREATE TABLE command or changing a few words in your Pig script.

For example, to change our Hive example to use Parquet instead of the delimited textfile format, we simply refer to Parquet when we create the table:



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.